Data Mining Consulting Improve Data Quality
نویسندگان
چکیده
منابع مشابه
Data Mining Consulting Improve Data Quality
Data are important for making decisions. However, the quality of the data affects the quality of decisions. Data mining as one of the most important sources of knowledge needs high quality data to mine, but there are not enough good quality data in many enterprises. By analyzing the reasons for low data quality systematically, a new method called data mining consulting for improving data qualit...
متن کاملThe Big Data mining to improve medical diagnostics quality
The paper offers a method of the big data mining to solve problems of identification of cause-and-effect relationships in changing diagnostic information on medical images with different kinds of diseases. As integrated indices of the fundus vessels and coronary heart blood vessels we have used a global set of geometric features which is supposed to be a rather complete characteristic of diagno...
متن کاملData Quality Mining
In this paper we introduce data quality mining (DQM) as a new and promising data mining approach from the academic and the business point of view. The goal of DQM is to employ data mining methods in order to detect, quantify, explain and correct data quality deficiencies in very large databases. Data quality is crucial for many applications of knowledge discovery in databases (KDD). So a typica...
متن کاملA Data Mining Approach to Improve the Automated Quality of Data
The quality of the data in organisational information systems is a critical issue for organisations. The rapid growth in the volume and complexities of technologies related to databases and data warehouses has enabled executives and decision makers to more readily store, access, analyse and retrieve massive amounts of information to support business needs. For the most part, advantages, such as...
متن کاملEnhancing Learning from Imbalanced Classes via Data Preprocessing: A Data-Driven Application in Metabolomics Data Mining
This paper presents a data mining application in metabolomics. It aims at building an enhanced machine learning classifier that can be used for diagnosing cachexia syndrome and identifying its involved biomarkers. To achieve this goal, a data-driven analysis is carried out using a public dataset consisting of 1H-NMR metabolite profile. This dataset suffers from the problem of imbalanced classes...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Data Science Journal
سال: 2007
ISSN: 1683-1470
DOI: 10.2481/dsj.6.s658